53 research outputs found

    New perspectives on the performance of machine learning classifiers for mode choice prediction: An experimental review

    Get PDF
    It appears to be a commonly held belief that Machine Learning (ML) classification algorithms should achieve substantially higher predictive performance than manually specified Random Utility Models (RUMs) for choice modelling. This belief is supported by several papers in the mode choice literature, which highlight stand-out performance of non-linear ML classifiers compared with linear models. However, many studies which compare ML classifiers with linear models have a fundamental flaw in how they validate models on out-of-sample data. This paper investigates the implications of this issue by repeating the experiments of three past papers using two different sampling methods for panel data. The results indicate that using trip-wise sampling with travel diary data causes significant data leakage. Furthermore, the results demonstrate that this data leakage introduces substantial bias in model performance estimates, particularly for flexible non-linear classifiers. Grouped sampling is found to address the issues associated with trip-wise sampling and provides reliable estimates of true Out-Of-Sample (OOS) predictive performance. Whilst the results from this study indicate that there is a slight predictive performance advantage of non-linear classifiers over linear Logistic Regression (LR) models, this advantage is much more modest than has been suggested by previous investigations

    From domestic energy demand to household activity patterns

    Get PDF
    Residential building energy usage can be considered as being derived from the activity patterns of individuals inside the home. As such an activity-based energy demand model that can create in-home energy usage profiles from household activity patterns is the key to a better building energy demand analysis. In order to find the relation between building energy usage and activity profiles, energy usage data with an overlapping activity diary survey is needed. However, there is no detailed data containing information on both household activity schedules and energy usage. Therefore, utilizing a Bayesian approach, we explore the possibility of reverse engineering to get the household activity patterns from energy usage profiles. The findings can be further used for linking the domestic energy demand to the activity schedules of the occupants

    Integrated in- and out-of-home scheduling framework: A utility optimization-based approach

    Get PDF
    Existing activity-based modeling predominantly focus on out-of-home activities in order to understand transport demand. In this research, we extend the state of practice in activity-based modelling by determining both in- and out-of-home activities in a single scheduling framework. This approach has two main benefits: Firstly, it can capture the trade-offs between in-home and out-of-home activities. Secondly, in-home time-use patterns can be used to model high resolution energy demand. Our work builds on an existing optimisation framework, which treats individuals as maximising their total utility from completed activities and incorporates multiple scheduling decisions simultaneously. The approach is tested on a set of detailed daily schedules extracted from the the 2016-2020 UK Time Use Survey data. The results show that the model is able to generate peoples’ daily activity schedules based on their individual preferences and constraints

    From one-day to multiday activity scheduling: Extending the OASIS framework

    Get PDF
    Applications of activity-based models for the estimation of transport demand have demonstrated to achieve greater behavioural realism than traditional trip-based models. However, state-of-the art models focus on single-day schedules as focal points to their estimations, thus ignoring fundamental dynamics that explain individual behaviour over longer periods of time. Several authors have highlighted the importance of multiday analyses in activity-travel contexts, which are still lacking in many state-of-the-art framework. In this paper, we present an extension of the OASIS framework, an integrated model for the simulation of single day schedules, to include intrapersonal interactions influencing longer term decisions. We formulate the multiday problem as a multiobjective optimisation problem where each day d is associated with a utility Ud. We consider an activity-based set up where individuals maximise the total utility of their schedules over multiple days (e.g. week). We discuss implications and requirements of this formulation, and illustrate the methodology with chosen examples

    Integrated in- and out-of-home scheduling framework: A utility optimization-based approach

    Get PDF
    Existing activity-based modeling predominantly focus on out-of-home activities in order to understand transport demand. In this research, we extend the state of practice in activity-based modelling by determining both in- and out-of-home activities in a single scheduling framework. This approach has two main benefits and applications: Firstly, it can capture the trade-offs between in-home and out-of-home activities. Secondly, in-home time-use patterns can be used to model high resolution energy demand, which can contribute to demand side management. Our work builds on an existing optimisation framework, which treats individuals as maximising their total utility from completed activities and incorporates multiple scheduling decisions simultaneously. The framework has been extended to determine the choice of location for activities such as work, study, and leisure. The approach is tested on a set of detailed daily schedules extracted from the 2016-2020 UK Time Use Survey data. The results show that the model is able to generate peoples’ daily activity schedules based on their preferences and constraints

    Simulating intra-household interactions for in- and out-of-home activity scheduling

    Get PDF
    Various interactions, time arrangements, and constraints exist for individuals scheduling their day as a member of a household, which affect their in-home as well as out-of-home activity schedule. However, the existing activity-based models are mostly based on the individual decision-making process, which are limited in their demonstration of behaviour. We simulate multiple intra-household interaction dimensions within the same framework and capture the coordination of the activity scheduling decisions among all household members. Our approach adopts the Optimisation-based Activity Scheduling Integrating Simultaneous choice dimensions (OASIS) framework, which is at the level of isolated individuals and focuses on out-of-home activity schedules. We jointly simulate in- and out-of-home activities and incorporate interactions into the framework. Our framework contributes to the state-of-the-art in activity-based modelling by explicitly capturing multiple interactions within the same model, such as the allocation of the private vehicle to household members, dividing household maintenance responsibilities, escorting, joint activity participation, and sharing rides. We operationalise the model using time-use-survey data from the United Kingdom. The simulation results demonstrate the ability of the framework to capture complex intra-household interactions. We then demonstrate how these interactions can cause individuals to deviate from their schedules planned in isolation. This is a general framework applicable to different household compositions and available resources

    Choice set generation for activity-based models

    Get PDF
    Activity-based models have seen a significant increase in research focus in the past decade. Based on the fundamental assumption that travel demand is derived from the need to do activities and time and space constraints (Hägerstraand, 1970, Chapin, 1974). ABM offer a more flexible and behaviourally centred alternative to traditional trip-based approaches. Econometric — or utility-based — activity-based models (e.g., Adler and Ben-Akiva, 1979, Bowman and BenAkiva, 2001) postulate that the process of activity generation and scheduling can be modelled as discrete choices. Individuals derive a utility from performing activities, and they schedule them as to maximise the total utility of the schedule. In classical discrete choice model applications, the parameters of the utility functions can be estimated by deriving their maximum likelihood estimators. As the likelihood function is defined over a full enumeration of the alternatives in the choice set, this approach is limited for activity-based applications: the set of possible activities and their spatio-temporal sequence is combinatorial and not fully observed by either the decision-maker or the modeller. While discrete choice models can be estimated over samples of alternatives (e.g., Guevara and Ben-Akiva, 2013) an appropriate definition of such sample is as crucial as it is challenging. This paper presents a methodology to sample a choice set of full daily schedules for a given individual and a list of activities. The Metropolis-Hastings algorithm allows us to explore the space efficiently and draw both high and lower probability alternatives for consistent estimation of the parameters. The methodology is tested on a sample of individuals from the 2015 Swiss Mobility and Transport Microcensus (Office fédéral de la statistique and Office fédéral du développement Territorial, 2017)

    Integrated models of transport and energy demand: A literature review and framework

    Get PDF
    Energy and transport demand can both be considered as being derived from an individual’s activity participation. As such, both energy and transport demand are inherently linked: completing activities inside the home generates residential energy demand, where completing activities outside the home generates transportation and non-residential energy demand. Whilst there are several works in the literature that focus on either energy or transportation demand, there remain very few studies which explicitly investigate their interaction. To address this need, in this paper we conduct in-depth literature review of transportation and energy demand modeling. The review analyses the methodologies employed within each domain in order to (a) establish the state-of-research for energy demand modeling and (b) identify the suitable opportunities for joining these two domains. Drawing on a review of the current papers, we identify four key areas of practice: (i) activity scheduling, (ii) building energy demand, (iii) transportation energy demand, and (iv) the integration of components. Finally, based on the findings from the review, we propose a new framework for joint building and transportation energy demand modeling at an urban scale

    OASIS: Optimisation-based Activity Scheduling with Integrated Simultaneous choice dimensions

    Get PDF
    Activity-based models offer the potential of a far deeper understanding of daily mobility behaviour than trip-based models. However, activity-based models used both in research and practice have often relied on applying sequential choice models between subsequent choices, oversimplifying the scheduling process. In this paper we introduce OASIS, an integrated framework to simulate activity schedules by considering all choice dimensions simultaneously. We present a methodology for the estimation of the parameters of an activity-based model from historic data, allowing for the generation of realistic and consistent daily mobility schedules. The estimation process has two main elements: (i) choice set generation, using the Metropolis-Hasting algorithm, and (ii) estimation of the maximum likelihood estimators of the parameters. We test our approach by estimating parameters of multiple utility specifications for a sample of individuals from a Swiss nationwide travel survey, and evaluating the output of the OASIS model against realised schedules from the data. The results demonstrate the ability of the new framework to simulate realistic distributions of activity schedules, and estimate stable and significant parameters from historic data that are consistent with behavioural theory. This work opens the way for future developments of activity-based models, where a great deal of constraints can be explicitly included in the modelling framework, and all choice dimensions are handled simultaneously

    DATGAN: Integrating expert knowledge into deeplearning for population synthesis

    Get PDF
    Agent-based simulations and activity-based models used to analyse nationwide transport networks require detailed synthetic populations. These applications are becoming more and more complex and thus require more precise synthetic data. However, standard statistical techniques such as Iterative Proportional Fitting (IPF) or Gibbs sampling fail to provide data with a high enough standard, e.g. these techniques fail to generate rare combinations of attributes, also known as sampling zeros in the literature. Researchers have, thus, been investigating new deep learning techniques such as Generative Adversarial Networks (GANs) for population synthesis. These methods have already shown great success in other fields. However, one fundamental limitation is that GANs are data-driven techniques, and it is thus not possible to integrate expert knowledge in the data generation process. This can lead to the following issues: lack of representativity in the generated data, the introduction of bias, and the possibility of overfitting the sample’s noise. To address these limitations, we present the Directed Acyclic Tabular GAN (DATGAN) to integrate expert knowledge in deep learning models for synthetic populations. This approach allows the interactions between variables to be specified explicitly using a Directed Acyclic Graph (DAG). The DAG is then converted to a network of modified Long Short-Term Memory (LSTM) cells. Two types of multi-input LSTM cells have been developed to allow such structure in the generator. The DATGAN is then tested on the Chicago travel survey dataset. We show that our model outperforms state-of-the-art methods on Machine Learning efficacy and statistical metrics
    • …
    corecore